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Abstract —In this paper we present a mnlti-nser cooperative 
protocol for wireless networks. Two sources transmit simulta¬ 
neously their information blocks and relays employ opportunis¬ 
tically successive interference cancellation (SIC) in an effort to 
decode them. An adaptive decode/amplify-and-forward scheme 
is applied at the relays to the decoded blocks or their sufficient 
statistic if decoding fails. The main feature of the protocol is that 
SIC is exploited in a network since more opportunities arise for 
each block to be decoded as the number of used relays Aru is 
increased. This feature leads to benefits in terms of diversity 
and multiplexing gains that are proven with the help of an 
analytical outage model and a diversity-multiplexing tradeoff 
(DMT) analysis. The performance improvements are achieved 
without any network synchronization and coordination. In the 
final part of this work the closed-form outage probability model 
is used by a novel approach for offline pre-selection of the 
Aru relays, that have the best SIC performance, from a larger 
number of Ar nodes. The analytical results are corroborated 
with extensive simulations, while the protocol is compared with 
orthogonal and multi-user protocols reported in the literature. 

Index Terms —Successive Interference cancellation, cooperative 
protocol, multi-user communication, dense wireless networks, 
distributed space-time coding, multiplexing gain. 


I. Introduction 

One recurring research theme in the area of wireless net¬ 
works has been the optimal use of every node for the benefit 
of the complete network. The cooperative communication 
paradigm is one such approach, where relay nodes forward 
signals between other nodes to improve their performance m, 
n. The main benefit is a diversity gain that is essential 
for improving performance in slow fading channels. When 
multiple relays are available, they can create a distributed 
antenna system to mimic the behavior of the classic multi¬ 
antenna MIMO systems that accomplish the same with space- 
time coding (STC) lU, @1. In this case the relays receive a 
signal from a source, and then they can construct a distributed 
space-time code (DSTC) Q-H). The first works on DSTC 
considered relays that decode the received signals and then cre¬ 
ate the DSTC based on this result Q-Q. Besides this decode- 
and-forward (DF) DSTC strategy (refeiTed as DSTC/DF), 
more general DSTCs for amplify-and-forward (AF) relays 
were introduced in i). The work in fSl allows a single 
transmitter to broadcast an information block in one slot, 
while in the subsequent slot the relays encode their received 
signals into a distributed linear dispersion (LD) code. One 
key characteristic of this work is that the relays do not decode 
the received signals, hence the name DSTC/AF. Overall DSTC 
leads to a higher receiver SNR (diversity gain) but all forms of 
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Fig. 1. a) The network model in this paper consists of a multiple access 
multi-relay network (MAMRN). In this figure the MAMRN is mapped to a 
downlink transmission scenario, b) Cooperative protocol transmissions in the 
time domain. 


STC include also a multiplexing gain. Their relative interplay 
is captured with the diversity-multiplexing tradeoff (DMT) M- 
Higher spectral efficiency (SE) is possible since the higher 
SNR can be combined with higher order modulation and 
coding schemes (MCSs). However, in a two-hop network, the 
multiplexing gain that can be achieved with one source is equal 
to i Is). Hence, the classic relay-based systems are limited in 
terms of the potential multiplexing gain. 


The natural question is if one can exploit nodes that act 
as relays in a multi-user network with non-orthogonal trans¬ 
mission protocols that can lead to higher multiplexing gains. 
The relay network model in this case consists of a cascaded 
compound multiple access channel (C-MAC) in the first hop 
(the message from each source needs to be decoded at every 
relay uni), and a MAC in the second hop (Fig. |l(a)| l. This 
is also referred to as a multiple access multi-relay network 
(MAMRN). This setup has many applications in cellular 
in-band relay-based uplink and downlink (Fig. l(a)| ) ifTTIl . 
relay-based downlink multicast communication ca (also in 
Fig. |l(a)| with the addition of D2), and cellular device-to- 
device (D2D) communication where the relays can be users 
themselves Ha. 

Multi-source communication in this MAMRN can be han¬ 
dled efficiently with network MIMO techniques m. DF 
and AF relay protocols for uplink cellular communication 
were analyzed in llT4l - lfT6) . In ifTSll it was shown that DF 
relays for the MAMRN perform very well in terms of outage 
for slow fading channels in the low SNR regime, while 
AF performs better in the high SNR regime due to the 
lack of noise amplification. The work in ca is essentially 
the implementation of Han and Kobayashi DF scheme but 
for a cellular MAMRN. A third compress-and-forward (CF) 
scheme, originally proposed in ini, M, was shown to be 
better than AF and DF for the MAMRN in lfT9l , under the 
assumptions of a non-fading channel, full-duplex relay, and 
multicell processing (in our scenario two destinations would 
be needed to communicate through a backhaul link). Hence, 
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this CF scheme cannot be implemented directly in our setup 
since it requires side-information from a second BS. Another 
important result in mi, was that when the relay power is 
the same as the source, then CF performs slightly worse than 
AF even under multicell processing. More recently, the authors 
in IMIl considered a MAMRN with the use of structured lattice 
codes at the sources and AWGN links on the second hop. 

Although the topology we consider uses two hops, several 
works have focused on multi-user communication in single 
hop networks usually with the presence of a single relay, but 
also without one. The first scenario is typically referred to 
as the multiple access relay channel (MARC). Some of these 
protocols we discuss below can be adapted to operate in two- 
hop networks. These protocols are similarly categorized as 
DF/AF. Overall in a MARC higher multiplexing gains can 
be achieved as shown in ED-Ea. In ll24l the authors pro¬ 
posed complex field network coding (CFNC) for the MARC. 
The relay uses DF and symbol-level network coding over 
a complex field for the symbols originating from the two 
sources. This protocol requires the pre-distribution of the 
coefficients in the complete network. Furthermore, in the same 
work extensions for multiple relays and sources are provided 
while the protocol is inherently engineered to support a single 
destination. Another recent work reported in 112^ . proposed 
a DF-based DSTC protocol for a single hop network that 
consisted of two sources and one receiver without the presence 
of a relay. The protocol achieves a multiplexing gain equal to 
I for a half-duplex system, and 1 for a full-duplex system. 
This last work will serve as an interesting benchmark for our 
system. Regarding AF-based protocols, the authors in mi 
proposed an analog network coding (ANC) based protocol for 
the multicast MARC channel. For the MARC with a single 
destination, the authors in ||25]| generalized the concept of 
ANC with relays that execute ANC in a complex field (i.e., 
similar to CFNC). A transformation operation is executed at 
the relay based on feedback received from the destination. In 
the case of a multi-relay network the extension is not directly 
obvious (hence difficult to collect full diversity), while the 
increased rate of transmitter channel state information (CSIT) 
needed at the relays, might be a problem similar to Ell¬ 
in im, 11221 AF-based DSTCs with a special non-orthogonal 
structure for the two-user MARC were proposed. 

In this paper we propose a multi-user cooperative protocol 
for the MAMRN in Fig. |l(a)| We allow the two sources 
to transmit simultaneously without any coordination with the 
remaining network (e.g., precoding, beamforming), or precise 
symbol/bit/packet-level synchronization. Interfering signals 
are exploited in the MAMRN by attempting to decode at each 
relay both simultaneously-transmitted information blocks. This 
is accomplished by using the optimal decoding strategy at each 
relay for this case which is successive interference cancellation 
(SIC) ESj. Next, we apply an adaptive DF/AF protocol both 
on decoded blocks but also on the non-decoded blocks. The 
sources and relays do not require any form of CSIT while the 
communication model in the second hop can be either uni¬ 
cast or multicast ca. We model rigorously the performance 
the proposed protocol that its preliminary version was first 
reported in ifZTll . Our transmission protocol is engineered with 


a mindset towards simplicity and full exploitation of potential 
relays without performance compromise. 

Contributions/Main results. The contributions of this work 
are: 1) A novel multi-user protocol design for relay networks. 
In the high SNR regime our DMT analysis, corroborated with 
simulation, reveals that our protocol achieves a multiplexing 
gain of I that is typically possible with one-hop DSTC sys¬ 
tems II 23 I . EH- The above is true even in strong interference 
conditions between the two sources. In the low SNR regime 
our protocol outperforms classic orthogonal DSTCs and multi¬ 
user protocols. 2) A rigorous outage analysis for the proposed 
protocol that jointly considers interfering transmissions, SIC 
decoding, and the AF/DF relay operations. The performance 
analysis constitutes on its own a contribution of this paper 
since it characterises the performance of multirate transmitters 
and SIC decoding. The overall model is used first for shedding 
light into the inner workings of our protocol, and second for 
the DMT analysis, while it is validated with simulations. 3) 
We also work towards extending our idea in more general 
networks with multiple relays. In particular we propose a 
relay pre-selection technique that uses our outage model for 
selecting nodes statically as suitable relay candidates based on 
their ability to decode with SIC. The ramifications of the last 
idea are important since it provides a path for a power-efficient 
and low-complexity use of nodes in dense wireless networks. 

Paper Organization. The rest of the paper is organized as 
follows. The system model and a detailed description of the 
proposed cooperative protocol can be found in Section HI] The 
outage performance analysis for the protocol is presented in 
Section HIH while the DMT analysis is in Section |IV] The use 
of our model for optimization in relay networks is presented 
in Section |V] Section |Vl| presents simulation and numerical 
results, while Section Ivnl presents our conclusions. 

II. Proposed Cooperative System 
A. System Model 

We consider a relay network model, depicted in Fig. |l(a)| 
with two sources SI and S2, and Nr relays denoted as RSI, 
RS2, ...., RSr. We examine a many-to-one communication 
scenario (S1,S2—i-Dl) in this paper. However, our protocol 
and our analysis are applicable, and without any modification, 
to a unicast scenario (SI—> D1,S2—J- D2), and also multicast 
(S1,S2—D1,D2,...). All nodes are equipped with a single 
omni-directional antenna that can be used both for transmis¬ 
sion and reception in half duplex mode, while all nodes have 
the same average power constraint. We denote the independent 
channels from the s-th source to the r-th relay as hg^r, and 
the channel from the r-th relay to destination as fr. These 
channels are block-fading (quasi-stationary) and Rayleigh, 
i.e., hs^r^CJ\f{\, 1), fr^CAf{l, 1) (complex Gaussian random 
variables with unit mean and variance). AWGN with zero 
mean and variance cr^ is assumed at all receivers. Receiver 
CSI is needed for SIC decoding and for calculating the power 
scaling at the relays. No further channel knowledge is required. 
CSI at the final destination is similarly assumed to be available 
from training signals or packet preambles Q, El, EH- 
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Regarding the paper notation matrices are denoted with bold 
capital letters while bold lowercase letters denote vectors. Also 
E[-] is the expectation of a random variable. 

B. Cooperative Protocol 

Since we assume that there is no direct link between the 
sources and the destination, three time slots are required for 
the transmission of two blocks (Fig. |l(b)[ ). In the first slot 
the sources broadcast simultaneously their blocks denoted 
as Xi, and X 2 respectively. The communication rates of the 
sources are i?i and i ?2 bits/symbol respectively, and they 
also take into account both modulation and coding ESI . 
The power dedicated to each transmitted block from the 
sources is normalized to unity, i.e., E[|a;ip]=E[|a; 2 p]=l. Thus, 
the baseband model for the interfering blocks at relay r is 
Ur = + /i 2 , 7 - 3^2 + Wr, where Wr is the AWGN sample 

at relay r. 

After the first broadcast phase of the protocol is completed, 
each relay decodes the two blocks by employing ordered SIC. 
That is, the block with the highest energy/bit is decoded first 
while the other block is treated as noise while no precise syn¬ 
chronization is needed El. To understand why the energy/bit 
is used for selecting the decoding order consider that there 
is no interference. Then, the probability of symbol error for 
MQAM can be approximated by '^Q 1291 . 

The fractional term inside the Q function is essentially the 
normalized SNR/bit. We can get a similar expression for the 
second source. Since we assumed E[|a;ip]=E[|a; 2 p]=l, the 
following condition is checked so that xi is decoded first {3 

\hl,r? ^ M ... 

2^1 - 1 ^ 2 R 2 - 1 

If xi is correctly decoded, it is subtracted from the aggregate 
signal Ur- Regarding the implementation of the cancellation 
mechanism it is executed at the block level. Thus, upon the 
successful decoding, and with CSI at the relay (in this example 
hi^r), we can completely remove/cancel a complete block from 
the aggregate received signal yr. 

Depending on the result, the r-th relay will transmit differ¬ 
ent signals in the r-th time slot of the protocol (Fig. |l(b)[ ). To 
denote these signals that the relays transmit we use the notation 
qr- This relay pre-processing can be compactly modeled as; 

Qr = ar,iXi + Or,23^2 + ar,3Wr (2) 

The adopted signal notation covers every possible block de¬ 
coding outcome at the relay through the complex parameters 
flr.i,ar, 2 ,Or.s defined in Table U In case both blocks are 
decoded at both relays, then effectively contains the super¬ 
position of the blocks XitX 2 - Based on the previous discussion 
we see that our protocol applies an adaptive AF/DF protocol 
for signal q^ depending on its content (decoded signal or not). 
In matrix notation the transmitted signal from all the relays is: 

Z = diag( 5 igi 52<?2 grQr ■■■) 

In the above matrix the rows indicate the relay and the columns 
the time slot, while is the transmit power for signal qr. Now 

*It is possible that different rules are used for selecting the symbol to be 
decoded first or even a completely different IC scheme can be adopted. 


decoded? 

Xi,X2 

Xi 

X2 

none 


1 

1 

^l,r 

^l,r 

0,r,2 

1 

h2,r 

1 

^2,r 

Or,3 

0 

1 

1 

1 


TABLE I 

Signal Modeling at the Relays. 


after scaling is applied, the relays broadcast their blocks in Z 
sequentially. A total of Ny = iVR -f 1 slots are needed. 

To define the system at the receiver let us denote with f the 
vector of the channel gains for the links between the relays 
and the destination. The received signal at the destination over 
Ar consecutive slots will be the vector: 



hgiqi 

T 

w(l) 

y = 

/ 252'?2 

+ 

w{2) 


In the above w(r') are the AWGN samples during the specific 
slot. Elaborating on the above leads to; 


gifioi,! 

52/101,2 

r n 


5l/lOl,3Ulrl 


52/202,1 

51/202,2 

X 2 


52/202,3'0'r2 

-b 


= Hx -b w ( 3 ) 


The destination still receives Wr observations over Wr slots 
that it can optimally solve with our linear MIMO MMSE-SIC 
decoder E6\ described next. 

Eor decoding based on the signal model in Q we calculate 
the covariance matrix Sw of the noise vector. The entries of 
this Wr X Wr matrix are [Sw]r,r' = 0, Vr ^ r' and [Sw]r,r = 
5 r l/ 7 -Pl®r, 3 p<T^ + (T^. Eor final decoding of the transmitted 
symbols we apply linear MMSE equalization for the signal 
model in Q as follows: 


x = HDD{(H^S-iH-bI)-iH^S-iy} (4) 

In the above HDD stands for hard decision decoding]! To 
obtain the receiver SNR for SI under MMSE decoding, we 
only need the first column of H; 

7Z3 = (5) 

By defining 7 = ^ and elaborating on the above leads to the 
instantaneous SNR being: 


Nr 

1D = ^- 


9^r\fr\^\ar,l\ 


P|ar.3P + l 


7 


(6) 


7^1 9r\f^ 

Erom (| 6 ]l we note that decoding at D is conditioned on what 
has been decoded at the relays. This creates the requirement 
that the relay must indicate in the preamble of each packet 
the local SIC decoding results. Since the relays manipulate 
information at the block level, the SIC results are communi¬ 
cated with the transmitted block on the second hop. Thus, the 
overhead of indicating the above four local decoding results at 
each relay for each block they forward is negligible (2 bits for 
every block). The CSI requirement is tackled as we described 


^The adoption of HDD intends to lower the execution time of decoding at 
the relays and the destination and accelerate simulation. Joint demodulation 
and decoding of the channel code is expected to reduce bit enors but this 
aspect is not related to our study and well-investigated and characterized in 
the literature. 
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earlier. 


III. Outage Analysis 

In this section we calculate the outage probability of 
the proposed protocol. We only study the performance of 
SI since the same result applies to S2. For minimiz¬ 
ing the complexity of our derived expressions, we define 
the following random variables along with their expecta¬ 
tion; y=|/ii,ip,X=|/i 2 ,iP,E[|/ii,ip]=^,E[|/i 2 .iP]= 7 ^, and 
the same for every relay r. For the second hop we only need 
the average channels E[|/rp]='^, as we will see. Thus, we 
only manipulate exponential random variables in the remaining 
parts of this section. Also, in this analysis we consider error 
propagation in SIC which is critical for its proper performance 
modeling. Considering complex Gaussian codebooks, for a 
channel code at R bits/symboFHz, information is lost when 
the instantaneous channel capacity is lower than R, leading 
to the outage probability Pout=Pr{/(|/ipS'Wi?) < i?} where 
/(a:)=log 2 (l + x) is the Gaussian channel capacity and \h\ is 
the fading amplitude which is the same for the entire block. 

Outage Probability of Joint Events. With our protocol, 
the outage probability of the transmissions from SI has to be 
calculated for all the decoding events at the A^r relays. This 
is necessary since our protocol adapts the transmitted signals 
on the second hop depending on what it is decoded at the 
relays. The end-to-end outage probability expression has to 
consider 4^"* potential decoding events. However, we notice 
that because the random complex channel gains between the 
sources and the relays are independent, the SIC decoding 
events at the relays are also independent. To capture formally 
the above, let Rr be the instantaneous mutual information 
between the signal transmitted from source i and the received 
at relay r. This allows us to write for the probability of the 
representative event that no block is decoded at all relays: 

3 3 3 3 

Prl/ii < -i?l,/21 < -R2,Ii2 < 2^11^22 < 2^2, 

o o 

= < 2^1,/2. < 2 ^ 2 } (7) 

r—1 

One can similarly write the probability of all the other 4^’*- 
1 events. The end-to-end outage probability expression that is 
provided next is partially presented due to its several terms: 

-i^ Ap Ap 

Poutsi = Jl Pr{/i^ < —Rl,l 2 r < ^7?2} 

r—1 

Ap 

X Pr{log 2 (l + 7£>(a) < 

, 3 Ap , 

-f Pr{/ii > -i?i,/ 2 i < -^7?2} 

^ Ap Np 

X n Pr{7lr < ^Ai,/2r < ^i?2} 

r^2 

Nf 

X Pr{log2(l + 7D(a) < 

This expression takes into account the probability of a specific 
decoding result at all the relays, and this is multiplied with the 
probability that SI is in outage during the transmission in the 
second hop. This multiplication is possible since the events 



(a) = i?2 = 1 bit/symbol (b) Hi = 1 , R2 = 2 bits/symbol 


Fig. 2 . Numerical outage results for PrjTn < |Hi,/21 < §^2). 


on the first and second hops are independent. Since we have 
two sources and Nr relays, there are 4^'* potential decoding 
outcomes for the two transmitted blocks. Note that the fraction 
^ in front of every Ri , R 2 is because our protocol transmits 
2 blocks over Ap = 1-1- Ar time slots. 

Another consequence of the result in (|7]) is that the outage 
expressions for the first relay, can be directly re-used for every 
relay. The only difference in this case will be the different 
average channel gains (i.e., Ar, Hr)- The conclusion from this 
discussion is that we only need to calculate the probability of 
four events at a single relay. 

The first event is that both users are simultaneously in 
outage at the first relay: 

Ap Ap 

Poutsis 2 @RSi = Pr{7ii < —7 ?i,/2i < —R 2 } (9) 

Now the important detail is that if we apply SIC at the relay. 
Ill will be different depending which signal has the highest 
energy/bit since this will be the one that will be decoded first. 
In particular if the signal from SI has higher energy/bit than 
the signal from S 2 , i.e., if {Y > f^A} (this is condition ([T]i), it 
will be /ii=log 2 (l -f xT 5 ^)=log 2 (l + 7 ^)’ where 7 =l/cr 2 

Wp 

is the transmit SNR. Here, we have defined Ki=2 2 — 1 

K 2=2 2 — 1 as the SNR block decoding thresholds, again in 

our effort to minimize notation. Now in the opposite case that 
the signal from S2 has highest energy/bit than SI, then SIC 
will decode first source S2 which means that it can remove 
it from the aggregate signal. Thus, /ii=log 2 (l -I-7A). Hence, 
the event in (|9|l can be decomposed to two mutually exclusive 
events depending on which signal is decoded first: 

Pr{/n < ^Ri,l2i < ^i?2} 

= Pr{/ii < ^i?i,/2i < ^i?2,A> 

Z 1 1x2 

Nf Nf 

+ Pr{/n < ^i?i,/2i < -^R2.Y < y^A} ( 10 ) 

Z 2 fC 2 

From our previous discussion (fTOl l can be written as: 

Pr{7ii < ^Ri,l 2 i < ^R 2 } = Pt{Y - kiX < '^,Y > ^X} 
2 2 7 ^2 

-f Pr{X-fe2F < —,y < ^A} (11) 

7 K2 

The event {Y > fyA} considers all the cases where the block 
xi, that originates from SI, has the highest energy/bit and 
thus it will be decoded first. If this last event is true, then 
{Y — kiX < 7 } is ths event that SI cannot be decoded 
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(event {In < Note that this joint event includes the 

case that S2 cannot be decoded if {Y > is true. The 

reason is simply that the energy/bit is lower for S2 and so if 
SI cannot be decoded we cannot decode source S2. On the 
other hand, when [Y < is true in the second of the 

two independent events in diD , then similarly with before we 
only need to consider the probability that S2 will be in outage 
since in this case SI will definitely be in outage. 


since the signal from SI was not cancelled. This leads to: 

Pouts KgRSl = Pr{Jn < > ^R 2 ,Y < ^X} 

2 2 k2 

= Pr{r < —- k2Y > —,r < ^x} 

7 7 *2 


Ml + ^1^2 


7 


To calculate (dUl recall that X and Y are independent 
exponential random variables. This means that their joint 
probability density function (PDF) is separable. Thus, for the 
first event in (fTTl l: 


Pr{r - fciX < —, y > ^x} = 
7 k2 


kxx+^ 


fel 


Aiexp(-^) 


(14) 

In the above result note that when the decoding of S2 succeeds, 
then the signal X 2 will be removed from the aggregate. This 
means that the event that SI is not decoded is {Y < ^} since 
the decoder must only combat the noise after cancellation. 
Consider again the case Ai — cxd. This corresponds to a very 
small value for the average channel gain, i.e., E[|/i 2 ,iP] 0 

and so the power level of the received signal from S2 is very 
low. But one notes that we now consider the event {Y < 
f ( \ f event that the signal from SI will be received 

x\x}jY\y) y ^ gygjj lower power level than S2. Thus, the probability 
of the event in (fT4l i becomes zero simply because this event 
occurs infrequently as Ai —?■ oo. 


^1 + Mifr '^1 + Fi^i 


( 12 ) 


Similarly we calculate the probability of the second event 
in (fTTl l and then by adding the two results we finally obtain: 


Nf Nf 

PoUtsiS 2 @RSl = Pr{7ll < < -^R 2 \ 


Ai 


Aiexp(-iii^) 


+ 


Ml 


Miexp(- 1 ^) 


Ai+/ii/ci ^ + AiA:2 

(13) 

Before we proceed, it is instructive to understand an extreme 
case where Ai —?> oo. This corresponds to a very small value 
for the average channel gain from S2 to RSI, i.e., E[|/i 2 .iP] —>■ 
0. This leads to Poutsis 2 @RSi 1 ~exp(—as expected. 
By setting the other extreme value Ai —!• 0 orE[|M 2 ,ip] —(• oo 
the outage probability becomes zero. Now for different rate 
requirements we can see from (l20l l that the outage event does 
not converge to zero in the high SNR regime. 


With a similar methodology we calculate the probability of 
the other three events at the first relay. In particular the second 
event considers the case that SI in outage while S2 is not: 

Nf Np 

PoUtsi@RSI = Pr{^ii < -^Rijhi > -^R2} 

= Pr{/ii < ^Ruhi > ^R2,Y > ’^X} 

2 2 k2 

A^p A^p k\ 

+ Pr{/ii < > -^R2,Y < -^X} 

2 2 k2 

In the last decomposed expression we followed the same 
procedure with before. Only in this case the probability of 
the first of the two disjoint events is zero. This is again a 
result of the behavior of SIC that selects to decode first the 
symbol with the highest energy/bit: If the signal from SI is 
the strongest (i.e., {Y > ^X}), and the destination fails to 
decode it (i.e., {In < ^Ri}), then we cannot decode S2 


The third event is symmetric to what we just analyzed: The 
second source S2 is in outage while the first source SI is 
successfully decoded. This is expressed as: 

Np Np 

Pouts 2 @Rsi = Prl^ii > -^^ 1,^21 < -^^ 2 } (15) 

= Pr{y -kiX >—,X <—,Y > ’p-X} 

7 7 K 2 

. Np Np ki , 

+ Pr{Jn > ^i?i,/ 2 i < -ifR 2 ,Y < ^X} 

2 2 k2 

With the reasoning we followed in the previous paragraphs 
we can easily see that the probability of the second of the two 
events above is zero. Thus, the previous becomes: 

, Np Np , 

Pr{Jii > -^Rnhi < -fR 2 } 

= Vf{Y- kiX >—,X <—,Y >^X} (16) 

7 7 A :2 

The integration is slightly more complicated in this case, and 
so we calculate (Ell as: 

Pr{r -kpX > —,X <—,Y > ^X} 

7 7 fe 2 

= Pi{X <—,Y> '^X} 

7 ^2 

-PT{Y-kiX <—,X <—,Y>p-X} (17) 

7 7 fc 2 

The calculation gives: 

Pouts2@Rsi = Pr{y - fciX > > ’^X} 

7 7 «2 

-If fx{x)fY{y)dydx - I / fx{x)fY{y)dydx 






Xrexpi-^) 


(^1 - exp(- 


(Ai + yiki)k2 


') 


(18) 


Ai + Mifci ^ ' 7 

Even though this event is symmetric to what we analyzed 
in the previous paragraph, the results and their interpretation 
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(a) Pr(/ii < R\,l 2 i > R 2 ) (b) Pr(/ii > Ri^hi < R 2 ) 


Fig. 3. Numerical outage results of the joint events. 


decoding events, we set all the different values for a in the 
parameterized SNR expression of 7 £) given in (| 6 ]l, and then 
we include the results in the hnal outage expression in ®. 
It is challenging to derive closed-form expressions for the 
second hop, except of course the trivial case where both blocks 
are decoded at every relay. The reason is that in © is a 
fractional expression with exponential random variables in the 
numerator and the denominator. Nevertheless, we can calculate 
numerically the outage probability for the second hop and this 
is what we do when we present the numerical results similar 
to the related work ll23l . 


is not. To understand the performance consider again the 
hrst case where Ai —>^ 00 . This corresponds to a very low 
value for the average channel gain, i.e., E[|/i 2 ,ip] ^ 0. 
This leads to Pouts 2 @Rsi ^ exp(—Note that this is 
essentially the case that the received signal from S2 has 
negligible power and it practically does not interfere. Hence, 
this means that the outage behavior of SI approaches the 
well-known behavior of the point-to-point channel (recall that 
the outage probability of the event {In < ^Ri} in this 
case is 1 — exp(—Therefore, the probability of the 
opposite event, i.e., {In > is exp(—Care must 

be given to the high SNR approximation in this case. The 
high SNR approximation for (fTsT i with Ai —00 leads to 
lim.y_>oo Pouts 2 @RSi = 1- This is easy to understand because 
this case corresponds to S2 not being decoded at RS1 which 
will be true when Ai —00 (low inter-user interference). 
However, the high SNR approximation for (fTsT i with bounded 
Ai and 7 —00 leads to lim.y^oo Pouts 2 @Rsi = 0 . 

Finally, we consider the event that both sources are not 
in outage at the hrst relay. Since all the four events that we 
described in the last few paragraphs are mutually exclusive, 
and these probability calculations sum up to one, we can easily 
derive this last quantity. It is thus: 


Nf Nf 

PoUtnone@RSl = Pr{7ll > -^Rl,l2l > R 2 } 

= Pr{y -kiX > —,X > —,Y > ^X}) 

7 7 ^2 

+ Pr{Y > ^,X-k^Y > ^,Y <^X} 

7 J k2 

And this becomes: 


PoUtnone@RSl — 1 H" 


Aiexp(-^) {Xi+niki)k2 


+ 


Ai + /iifci 

/ 2 iexp(- 77 ) 


exp(-- 


Hl + Aifc2 
Ai 


exp(- 

Mi 


(pi -I- AiA:2)*i 






( 19 ) 


As we explained initially in this section, the outage expressions 
for the r-th relay are exactly the same. The only difference in 
this case are the average channel gains. 


End-to-End Outage Probability. Now we get back to 
the main outage expression in that includes the outage 
probability expressions for the second hop. Depending on SIC, 
different signals are transmitted. For each one of the 4^’* 


IV. DMT Analysis 


Two-hop single-user cooperative protocols that transmit a 
single block can achieve a multiplexing gain of i BOl . Our 
communication scheme uses Np=N^ +1 slots for transmitting 
two blocks. So this leads potentially to maximum multiplexing 
gain of I for Ar=2. To get the DMT expression, and verify 
the previous result, we have to consider all the 4^'* product 
terms in ®. In this case we observe that every term in the 
summation of Poutsi will be the product of Xr +1 probability 
terms (for example nf=i P0UtsiS2@RSrP0Utsi@D)- 

First hop. We calculate hrst the individual high SNR 
approximations for the outage probabilities of the hrst hop. 
The high SNR approximation for ( fT3l l leads to a constant, i.e., 
lim.y_>ooPoutsis 2 @RSi=C'i, with 


Cl = 


Ai 


Ai 


■ + ■ 


Ml 


Ml 


/ii + Ai/c2 


( 20 ) 


Similarly we obtain for RSr that lim-^^oo Poutsis 2 @RSi=C'r- 
The intermediate cases luckily lead to zero terms. For 
example the high SNR approximation for Gil leads to 
lim..y_>oo Pouts i@Rs 1 = 0 . Similarly it can be verihed that 
lim..y_>oo Pouts 2 @Rsi =0 and of course the same expressions are 
zero for any other relay RSr. The high SNR approximation 
of G 1 leads to another constant lim..y_j.oo PoutnoneORSi = Ci, 
where 


fj' ^ I -^1 I Ml _^1_ Ml 

Ai T Riki fii Xik2 Ai 4“ Mi7 

' ( 2 'l) 

Similarly we obtain for RSr that lim.y_).oo Poutnone@RSr= a. 
The above analysis means that in the high SNR regime we 
only have to consider 2 ^"* terms in i.e., the cases that 
either both or none of the signals are decoded at a relay. 

Second hop. The best case is that both signals from SI, S2 
are decoded at every relay. From ® we have that for SI: 




Id = 


Nr 

, lm^Poutsi@D = 


Np 

2 ~Mi — ^. atr 


( 22 ) 


From (I2TI) and ( 1221 ) we have that the approximation for this 
hrst event is: 


lim Pouts7 


Nr 

PoUt,ione@RSiPoUtsi@D 

r=l 


Nr 

( n ( 


2 —«i _ 1 
7 

(23) 


The other events are covered by he worst case that no block 
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is decoded at all relays: 

Nr 

Pout ^''“‘2 ^ n P0Utsis2@RSrP0Utsi@D (24) 

r—1 

The first terms converge to constants Cr while we only need 
to approximate the final term in (l24l l. To do that, and to 
simplify the final result, we create the equivalent channel 
model for Q; We denote the first column vector of H as 
= [hi^dfi h 2 ,df 2 ■■■ hr^frY■ the instantaneous 
channel matrix is H, the mutual information between the 
transmitted signal xi and the sufficient statistic y at the 
receiver is: 

/(:ri;y|H = H) = /(xi;Hx +w|H = H) 

®/(cri;H,,iXi+w|H = 7T) (25) 

In the above (l25l l is because the sufficient statistic for xi 
requires only the first column of H. The mutual information 
of the IxA^r MIMO channel H* icci + w is: 

logdet(Ijv,(26) 

The mutual information can help calculate the behavior of the 
lower bound on the outage probability. Namely: 


^ Pr{logdet(I^, + 

’ z 


PoutsfgD = Pr{logdet(lArK 

(S) 


(27) 


r—1 


Pr{log (l + 7^5rl/rPlVdP) 

Nk 

Kn 


- 


lim Pout; 

7 —>-oo 


LB 

SK^D 




jVp p, 

2—r? 1 — 2 ^ ^ a^r 


7 


(28) 

In the above (l27l l is because the colored noise does not affect 
the DMT analysis, that is approximated with white noise 
(§9, ll26ll f. Also (1281) is because the resulting random variable 
that is the product of two exponential random variables is 
independent across the N-p paths i). For the lower bound 
event 2 leads to 


lim Poutsf@F) = 


^ 2 7 — 1 ^ JVr 


7 

Finally, the upper bound Poutsf^p is that of (l22l i. since it is 
impossible to have a better behavior for the protocol from the 
relays to D. So for the complete protocol: 

2^Ri _ Is 


/ 2 2 — 1 \ Nr 

lim Pouts 1 = (-) 

r'—>CXD V 'y / 


7 —>CXD 

Final Result. Given that the multiplexing gains and the data 

rates of SI and S2 are related by: i?i=ri log( 7 ),i? 2 =r ’2 log( 7 ), 

we have, the DMT expression is: 

-logPoutsi WrWf ^2 

d = hm -^- = Ar --—ri, 0 < ri < — 

7^00 log 7 2 iVp 

One observes that the diversity gain of SI is only affected 

by rp. This is because S2’s mulitplexing gain r 2 can only 

affect SI in the decoding of S2 with SIC. But whether the 


relays decode or not, there is no information loss for SI 
since AF is used in the later case. This is another key design 
feature of our protocol, i.e., the use of AF opportunistically is 
instrumental to avoid diversity loss. Note also that the protocol 
in ll23l achieves the same DMT but for a half-duplex single¬ 
hop network, i.e., the high SNR performance is the same. 
Of course for finite SNR, the performance is different. Both 
aspects will be examined with simulations. 

V. Optimization in a Multi-Relay Network eor 
Improved SIC Performance 

In this part of our work we explore a scenario where from 
the number of potential relays that are present in the network 
Ar, a number A^ru of them is used simultaneously by the 
proposed protocol. Contrary to classic relay selection protocols 
that require all relays to overhear the transmitted signal, we 
propose to perform relay pre-selection, i.e., to select statically 
which set of relays will be used by the protocol for maximizing 
the SE of the system. Consequently, the maximum diversity 
gain is Aru but the benefit is that by using our model we can 
pre-select nodes that can have high performance with SIC. 
The remaining A^r — A^ru nodes can remain idle and do not 
have participate at all in the communication (e.g., no need for 
overhearing, power consumption) contrary to the behavior of 
classic cooperative protocols that we reviewed. To formally 
define the problem we address now, we use a single opti¬ 
mization variable Xr that indicates whether relay r is used. In 
vector form we have x = {xr G {0,1} : r € {1, 2 ,..., A^r}). 
Our objective is to select A^ru relays so as to maximize the 
spectral efficiency: 

max 'YY (^PoUts 2 @R-r(i?l, i? 2 , Ar, Pr)Rl 
rev. 

+ P0Utsi@Rr(7?l, R 2 , Ar, Pr)R2 

-j- PoUtnone@Rr(7^1,7^2 , Ar, Pr ) (7^1 “t” 772)^ 

subject to "YY ^r = Aru, Xr € {0,1} (29) 

rev 

The constraint ensures that only Aru relays are used. The 
specific formulation is easily solved in (9(1) since we simply 
select the first Aru relays that have the objective with the 
highest value. Note that this optimization is executed only 
on large time scales and by considering the average channel. 
Note that more sophisticated formulations could also include 
as additional optimization variables both 7?i, i ?2 and also the 
number of simultaneously used relays Aru. 

VI. Performance Evaluation 

Our performance evaluation consists of Monte Carlo sim¬ 
ulations and numerical results, and has four objectives: 1) 
underpin the design choices of our protocol, 2) investigate 
the performance benefits over related work, 3) validate our 
performance model, 4) demonstrate the use of our system in 
a multi-relay network setting. Since the numerical analysis 
corresponds to the outage probability that assumes the use 
of a capacity-achieving code, we use an LDPC code with 
length of 2048 bits in our simulations. We compare our 







SEPTEMBER 4, 2015, TO APPEAR IN IEEE TRANSACTIONS ON COMMUNICATIONS 




Fig. 4. Simulation results for different flavors of the proposed protocol and 
different a) E[/i^]=l for every channel, and b) E[h^ 2 ] — E[h 2 ]^]= 0.1 

while for the remaining ones E[^^]=l. 


Fig. 6. Simulation results for the SE of different protocols and different 
MCSs. a) E[/i^]=l for every channel, and b) E[h^ 2 ] — E[/i2i]=0.1 while for 
the remaining ones K[h^]=\. 



Fig. 5. The complete set of the tested protocols for Ri, R2=2 bits/symbol, 
a) E[/i^]=l for every channel, and b) E[/i,j 2 ] — E[/i2i]=0.1 while for the 
remaining ones E[/i^]=l. 

proposed protocol against DSTC protocols that use orthogonal 
transmissions fS), El. With DSTC/DF El the sources transmit 
orthogonally and each relay forwards the decoded, while all 
the relays broadcast simultaneously with the same Alamouti 
STC. The DSTC/AF El protocol applies the same Alamouti 
code without decoding the signals. Regarding multi-user (MU) 
communication protocols we compare with CFNC ll24l but 
in the same topology with our proposed protocol (2-hop 
MAMRN with two sources, two relays, and one destination) 
to ensure fairness. The ANC-based protocol reported in lfT2l is 
also evaluated for the same topology. Finally, we also present 
the performance of the DF scheme reported in 1i23i that is 
denoted in the figures as DSTC/DFl. The important detail 
with this scheme is that it uses a DF protocol from two 
sources located one hop away from D. Hence, the comparison 
is unfair in favor of DSTC/DFl and against every other tested 
system, but it is useful to serve as an interesting bound for 
the maximum multiplexing gain. With the half-duplex protocol 
in II 23 I . the two sources transmit independently over two slots, 
while they overhear the transmission from each other. This 
means that during a third slot they can both transmit the 
block they overheard with an Alamouti code. The maximum 
multiplexing gain is |, similar to our protocol. 

Protocol Design. First we evaluate the fundamental design 
choices made in our protocol under the presence of two relays. 
We present two ’’downgraded” versions of our system namely 
Proposed/AF that applies forwarding operations without SIC 
decoding, and Proposed/DF that applies SIC and forwards 
only the decoded blocks. In Fig. |4(a)| we present outage 
results for a first scenario with high inter-user interference (i.e.. 


]E[|/ii, 2 p]=l, IE[|^ 2 ,iP]=l)- We note that the Proposed/AF and 
Proposed/DF are different in terms of performance depending 
on the MCSs. The reason is that SIC suffers from high inter¬ 
user interference and asymmetric MCSs and so the AF strategy 
is better in this case. For low inter-user interference in Fig. |4(b)| 
the Proposed/DF scheme is slightly better for asymmetric 
MCSs since it exploits the higher number of decoded blocks, 
but it is still worse than the Proposed/AF after 25 dB. This 
makes more clear the need for our protocol that is robust 
to both low and high inter-user interference and different 
MCSs, i.e., it leverages opportunistically decoded signals but 
when it cannot, the signals are still forwarded avoiding thus 
information loss. Hence, a trivial combination of DF or AF 
strategies with SIC or with undecoded interfering signals 
respectively, is not enough and robust to different combination 
of channel gains and MCSs. 

Outage comparisons for symmetric/asymmetric links. 

In Fig. |5(a)| we present outage results for a scenario with 
Ar=Aru= 2 and high inter-user interference (i.e., E[|/ii_ 2 p]=T 
E[|^ 2 ,iP]=l)- Regarding the protocols that use orthogonality 
like DSTC/AF El and DSTC/DF EIj we observe a similar 
behavior in the sense that DF is better than AF, even though 
in this case we have orthogonal transmission and there is no 
inter-user interference. Since the gains on the second relay 
are strong (i.e., E[|/ii_ 2 p]=l,]E[|/i 2 ,iP]=l) the performance 
is very good for both schemes. For the MU-based protocol 
CFNC generally performs very well for symmetric links and 
approaches the performance of DSTC/DF and DSTC/DFl. 
CFNC extracts the maximum diversity gain since the channel 
between each source and the two relays is independent and 
strong. However, our proposed protocol performs better below 
20 dB while CFNC is better beyond 20 dB. Our scheme 
approaches the performance of the orthogonal DSTC/DF El- 
Overall our protocol is inferior in terms of outage only in the 
high SNR regime only for a fully symmetric network. As we 
will later see even this is not a problem in terms of SE. 

For asymmetric links in Fig. |5(b)| (i.e., E[|/ii 2p]=0.1, 
E[|^ 2 ,iP]= 0 . 1 ) the performance benefits of our scheme are 
even more evident over the complete SNR range. The benefits 
are because of the improved SIC decoding performance when 
the links are asymmetric. It is also important to see that in this 
case our scheme outperforms the DSTC/DFl protocol simply 
because in that protocol the relay executes a 2-hop communi- 
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Fig. 7. Simulation and numerical results for the outage probability of the proposed protocol. 


cation and only forwards when it successfully decodes. Also 
note that CFNC matches the performance of DSTC/DF due 
to the lower number of network coding opportunities (since 
E[|/ii, 2 p]= 0.1 a small number of blocks from the second 
source are decoded in the same relay). 

Spectral efficiency comparisons. Now we compare the SE 
of our protocol with that of a subset of the previous protocols 
again for A^r=A^ru= 2. Besides CFNC, we also present results 
for DSTC/DFl. Transmitting with orthogonal DSTC protocols 
is suboptimal in every case as we already observed from 
the outage results and so we do not include them here. For 
the case of high inter-user interference and for i?i=i? 2=2 we 
notice in Fig. |6(a)| that even though there was slightly worse 
outage performance of our proposed protocol when compared 
to CFNC after 20 dB, this is not translated to SE loss. The 
reason is that the outage probability is already very low for 
our protocol. Even more importantly, in this SNR regime the 
higher MCS with i?i=i?2=3 bit/symbol (that can be used by an 
adaptive protocol) leads to better performance of our protocol 
over CENC even beyond 20 dB. This is important for obtaining 
gains with our protocol in every channel gain scenario. 

Interesting results are also obtained when we compare 
the SE for low inter-user interference (i.e., E[|lii 2p]=0.1, 
E[|/i2.iP]=0.1) in Eig. |6(b)| We notice that our proposed 
protocol can achieve a spectral efficiency of 1.2 bits/symbol 
for a transmit SNR of 11 dB while CENC is considerably 
lower. This particular average SNR is important since it is 
the borderline case where a higher order MCS is needed 
for better performance. The gains are even more pronounced 
for the higher order MCS of i?i=i?2=3 bit/symbol. Now 
DSTC/DEl ll23l achieves lower gain simply because there is 
only one good link from the source to the destination while 
for the link from the second source to the destination it is 
E[|/i 2 ,dp]= 0.1 in accordance with our setup in this paragraph. 
We can also see that we can obtain the same spectral efficiency 
in the high SNR regime also for our scheme. This result 
verifies the DMT analysis that predicts multiplexing gain of | 
in the high SNR. It is also important to see that our scheme is 
better than the performance of DSTC/DEl 1^ even with the 
disadvantage of two hops since the later protocol uses only DE. 
This illustrates again the important benefits of our protocol. 



(a) Diversity gains with 3 and 4 used^b) The relay pre-selection algorithm 
relays simultaneously for different number of potential relays 


Fig. 8. Outage probability for multi-relay networks. 


Model validation. We present numerical and simulation re¬ 
sults for symmetric rate requirements of i?i=i? 2 =l bit/symbol, 
Ar=Aru=2, and different channel conditions for the com¬ 
pound MAC in first hop in Eig. |7(a)| and Eig. |7(b)| The 
parameters of the second hop are E[|/ip]=E[|/2p]=1.0. In 
Eig. |7(a)| we set E[|/i2.2p]=0.1. It is important to note that the 
performance of our protocol is dictated by the average channel 
gain from the source SI to the second relay (E[|/ii 2 p]). 
This is because when E[|/ii_2p]=1.0 our protocol can decode 
a strong signal from SI that arrives at this second relay 
and so it can apply a STC for extracting a diversity gain 
of 2. The same is true also in Eig. |7(b)| where we observe 
that the performance remains relatively unaffected even when 
the source S2 interferes significantly at the second relay 
(E[|/i2,2p]=l-0 in this figure). Again the reason is that SIC 
behaves very well since a stronger signal from S2 towards 
RS2 (E[|/i 2 , 2 p] =1.0) can be cancelled more effectively. Our 
numerical results for this case follow closely the simulation. 
Numerical and simulation results agree very well because our 
analysis also considers the impact of SIC error propagation. 
Even though not clearly visible, we observed that the under¬ 
estimation of the simulation in the low SNR regime is because 
a fixed-length LDPC code was used. The results for different 
MCSs are shown in Eig. |7(c)[ and they illustrate the validity of 
our model with respect to the different and asymmetric MCSs. 

Multi-relay diversity. Multi-relay diversity is examined in 
Eig. |8(a)| for fully symmetric links. Now we evaluate a number 
of used relays equal to /Vr=Wru= 3 and Vr=Aru= 4. A very 
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important observation is that for higher TVru the bordeline 
case where CFNC outperforms our proposed protocol moves 
in the higher SNR regime unlike the case of N^u=2 where this 
threshold was at 20 dB. Also both MU protocols outperform 
the classic orthogonal DSTC/AF that is known to achieve also 
full diversity. 

Multi-relay networks and pre-selection. In the next sce¬ 
nario we have a network where Ar relays are present but 
only Aru are pre-selected statically for simultaneous use. Here 
i?i=i?2=2 bit/symbol. The relays are spread randomly and uni¬ 
formly in an area between the sources and the destination. In 
this case the distance between two nodes i,j dist(i, j), is nor¬ 
malized between 0 and 1, while we introduce a path loss model 
for the average channel gains, i.e., E[|ft,ijp]=l/dist(i, j)^. In 
Fig. |8(b)| the performance of the proposed system increases 
for high Ar and saturates for Ar=15, 20. Even though there 
is no diversity gain from the presence of additional relays, 
i.e., higher Ar, since Aru is fixed to 2, better performance is 
achieved overall. Note that a classic orthogonal scheme that 
similarly uses Aru=2 relays, cannot exploit the presence of 
more than 5 (three curves coincide). 

VII. Conclusions 

In this paper we introduced a novel cooperative protocol for 
a two-source multi-relay network that is based on opportunistic 
SIC and AF/DF relay operations. Analytical and simulation 
performance results show that significant diversity and mul¬ 
tiplexing benefits can be observed over orthogonal DSTC 
protocols, and also multi-user protocols. We also introduced 
the static relay pre-selection concept enabled by our outage 
model that allows the exploitation of nodes as relays in dense 
networks with no practical overhead. Future work will inves¬ 
tigate first the presence of more sources. Other avenues for 
future work include the optimization of the AF/DF functions, 
the study of the implications on the MAC protocol design, 
and the use of our idea in specific LTE and small-cell network 
topologies. 
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